CpGFilter: model-based CpG probe filtering with replicates for epigenome-wide association studies

نویسندگان

  • Jun Chen
  • Allan C. Just
  • Joel Schwartz
  • Lifang Hou
  • Nadereh Jafari
  • Zhifu Sun
  • Jean-Pierre A. Kocher
  • Andrea A. Baccarelli
  • Xihong Lin
چکیده

SUMMARY The development of the Infinium HumanMethylation450 BeadChip enables epigenome-wide association studies at a reduced cost. One observation of the 450K data is that many CpG sites the beadchip interrogates have very large measurement errors. Including these noisy CpGs will decrease the statistical power of detecting relevant associations due to multiple testing correction. We propose to use intra-class correlation coefficient (ICC), which characterizes the relative contribution of the biological variability to the total variability, to filter CpGs when technical replicates are available. We estimate the ICC based on a linear mixed effects model by pooling all the samples instead of using the technical replicates only. An ultra-fast algorithm has been developed to address the computational complexity and CpG filtering can be completed in minutes on a desktop computer for a 450K data set of over 1000 samples. Our method is very flexible and can accommodate any replicate design. Simulations and a real data application demonstrate that our whole-sample ICC method performs better than replicate-sample ICC or variance-based method. AVAILABILITY AND IMPLEMENTATION CpGFilter is implemented in R and publicly available under CRAN via the R package 'CpGFilter'. CONTACT [email protected] or [email protected] SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Characteristics of DNA methylation and gene expression in regulatory features on the Infinium 450k Beadchip

Understanding the relationship between variations in DNA methylation and gene expression has been challenging. Evidence suggests the function of DNA methylation may vary with genomic context, and few consistent rules linking methylation to expression have been noted. For array-based studies, the content of current DNA methylation array platforms provide broad coverage of the genome but target o...

متن کامل

Trend in DNA Methylation of Human Blood

Background: Betweenand within-person variation in DNA methylation levels are important parameters to be considered in epigenome-wide association studies. Temporal change is one source of within-person variation in DNA methylation that has been linked to aging and disease. Methods: We analyzed CpG-site–specific intraindividual variation and short-term temporal trend in leukocyte DNA methylation ...

متن کامل

An epigenome-wide association meta-analysis of prenatal maternal stress in neonates: A model approach for replication

Prenatal maternal stress exposure has been associated with neonatal differential DNA methylation. However, the available evidence in humans is largely based on candidate gene methylation studies, where only a few CpG sites were evaluated. The aim of this study was to examine the association between prenatal exposure to maternal stress and offspring genome-wide cord blood methylation using diffe...

متن کامل

A Comparative Study of Five Association Tests Based on CpG Set for Epigenome-Wide Association Studies

An epigenome-wide association study (EWAS) is a large-scale study of human disease-associated epigenetic variation, specifically variation in DNA methylation. High throughput technologies enable simultaneous epigenetic profiling of DNA methylation at hundreds of thousands of CpGs across the genome. The clustering of correlated DNA methylation at CpGs is reportedly similar to that of linkage-dis...

متن کامل

Grasping nettles: cellular heterogeneity and other confounders in epigenome-wide association studies

Platform technologies for measurement of CpG methylation at multiple loci across the genome have made ambitious epigenome-wide association studies affordable and practicable. In contrast to genetic studies, which estimate the effects of structural changes in DNA, and transcriptomic studies, which measure genomic outputs, epigenetic studies can access states of regulation of genome function in p...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Bioinformatics

دوره 32 3  شماره 

صفحات  -

تاریخ انتشار 2016